-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add data validation step #95
Conversation
Thank you!Thank you for your pull request 😃 🤖 This automated message can help you check the rendered files in your submission for clarity. If you have any questions, please feel free to open an issue in {sandpaper}. If you have files that automatically render output (e.g. R Markdown), then you should check for the following:
Rendered Changes🔍 Inspect the changes: https://github.com/epiverse-trace/tutorials-early/compare/md-outputs..md-outputs-PR-95 The following changes were observed in the rendered markdown documents:
What does this mean?If you have source files that require output and figures to be generated (e.g. R Markdown), then it is important to make sure the generated figures and output are reproducible. This output provides a way for you to inspect the output in a diff-friendly manner so that it's easy to see the changes that occur due to new software versions or randomisation. ⏱️ Updated at 2024-09-13 09:08:31 +0000 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the contribution @Degoot-AM!
This complementary content is appropriate to show the whole workflow of {linelist}
: tag, validate, safeguard, and get a tagged-only data frame.
I think we can still work on the arrangement of this content to easily identify these four tasks.
For my edit suggestions, I suggest first incorporating the specific in-line edits (here on GitHub) and then working on the major content rearrangements (locally on our machines by new commits):
- move the tags_df to the end
- add a challenge to test a different saveguard
- hide the tags_types output in a challenge hint to make this useful and not only descriptive
- simplify the showing of the validate functions
Interested to discuss further any of these proposals.
I think the main theme in this episode could be that "we need clean data to allow appropriate tagging and validation before running the analysis" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @Degoot-AM for this modifications. This looks good!
I tested it as a learner. From this, I suggested using the simplest pieces of code to learn about the {linelist} outputs. Also suggested some text edits to make the content more readable. Lastly, I provided my longest edits to the challenges given that we can provide "formative assessments" based on code that could produce these moments of "oh! this has changed!" and then the learning, as is the approach you are following.
Note: some code solutions are in bullets on purpose to allow its writing. If accepted, will be removed as in #106
This is my last request for changes, I promise :D
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Accepting changes proposed by @avallecam.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for accepting the feedback edits. I added some extra commits to clarify some lines I suggested. Now ready to merge 🚀
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
Co-authored-by: Andree Valle Campos <[email protected]>
7a380dd
to
13c0dba
Compare
This PR adds data validation steps using the linelist package, addressing issue #94
Fix #94
Fix #109
Fix #96